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(54) Method and apparatus for calculating a pixel block from one or two prediction blocks 



(57) A method is disclosed for calculating the pixel 
values of a sub-pixel accuracy motion compensated 
block of video pixels from one or two reference blocks, 
as is required in typical digital video compression and 
decompression systems. This method uses a minimal 
amount of temporary storage memory resulting in a 
compact architecture suited for inexpensive consumer 
applications. This invention utilizes a pixel pipeline 
within a block line pipeline to calculate the half pel accu- 
rate reference blocks and to average two blocks to result 
in a prediction block of pixels. The lines from each refer- 
ence block are input to the invention alternately result- 
ing in reduced memory requirements. 
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Description 

BACKGROUND OF THE INVENTION 

1 . Field of the Invention s 

The present invention relates to a digital video 
motion compensation method and apparatus used to 
calculate prediction blocks to half pel precision using 
both the forward and backward directions using minimal 10 
temporary storage requirements. This is especially 
applicable in digital video compression and decompres- 
sion systems. 

2. Description of the Prior Art is 

Many digital video processing systems utilize 
motion compensation techniques in which a previously 
processed picture or portion thereof is used for further 
processing of another picture. A typical example is the 20 
motion compensation techniques used by digital video 
compression and decompression processes in which 
groups of pixels from one picture are used to predict 
groups of pixels in other pictures. For example, MPEG- 
1 (see "CD11172 - Coding of Moving Pictures and 25 
Associated Audio for Digital Storage Media at up to 
about 1.5 Mbps" by International Organization for 
Standardization, ISO MPEG Document, ISO- 
IEC/JTC1/SC2/WG8, 1992) and MPEG-2 (see 
"IS13818 - Generic Coding of Moving Pictures and 30 
Associated Audio" by International Organization for 
Standardization, ISO MPEG Document, ISO- 
IEC/JTC1/SC2/WG1 1 , 1994) utilize bidirectional half 
pel accuracy motion compensation. This entails dividing 
a picture up into groups of pixels called macroblocks 35 
which are predicted individually using motion vectors 
which refer to previously processed pictures located 
temporally before and after the picture being processed. 
The motion vectors supported by these standards are 
half-pixel precision which means that the prediction 40 
macroblocks pixels may have to be calculated as the 
average of two or four pixels. In addition, MPEG-2 sup- 
ports several modes including field/frame motion com- 
pensation in which a macroblock can be predicted using 
previous frame or previous field data. 45 

Fig. 2 shows how half-pixel precision is used to 
generate a prediction block. In this figure, the white cir- 
cles represent full pixel precision pixels, the grey circles 
represent half pixel precision pixels calculated as the 
average of two white pixels in either the horizontal or so 
vertical direction, and the black circles represent half 
pixel precision pixels calculated as the average of four 
white pixels. Fig. 3 shows how a block can be predicted 
using frame motion compensation by averaging two 
blocks from two reference pictures. The white ovals rep- 55 
resent the pixels of the two references and the grey 
ovals represent the resulting predicted pixels calculated 
by averaging the white pixels. Note that each reference 
block may or may not have been determined using half 



pixel accuracy interpolation. Fig. 4 shows how a block 
can be predicted using field motion compensation by 
combining individual fields. Each frame comprises of 
two fields which can be separated, as shown by the 
white and light grey circles representing pixels of two 
fields of a picture frame. The reference fields can then 
be combined individually by averaging, and then the 
resulting prediction fields combined to make the pre- 
dicted block. Again, note that each reference block 
(field) may or may not have been determined using half 
pixel accuracy interpolation. 

The most obvious architecture to do this construc- 
tion of a prediction block is shown by Fig. 5. In this fig- 
ure, numeral 501 is the picture memory, 502 is the read 
block memory for reference 1, 503 is the read block 
memory for reference 2, 504 is the half-pixel interpola- 
tor, 505 is the reference block memory for reference 1, 
506 is the reference block memory for reference 2, 507 
is the reference averager, 508 is the prediction block 
memory, 509 is the IDCT (inverse discrete cosine trans- 
form) processor, 510 is the difference block memory, 
511 is the reconstructed and 512 is the decoded block 
memory. The full pixel reference blocks are read from 
the picture memory 501 into read block memories 502 
and 503. The half-pixel interpolation is performed on 
each block by the half-pixel interpolator 504 and the 
resulting block is written into a reference block memory 
505 and 506. Following this, the two references are 
combined by the reference averager 507 with the result 
being written into a prediction block memory 508. This 
prediction block is then combined using the recon- 
structer 51 1 with the contents of the difference block 
memory 510 and stored in the decoded block memory 
512. Finally, the decoded block is written to the picture 
memory 501 . The difference block memory 510 is filled 
by other decoding processes, in this case the IDCT 
processor 509. For MPEG1 and MPEG2, this is the 
inverse discrete cosine transform (IDCT) process. 

The VDSP2 (see "Video DSP Architecture for 
MPEG2 CODEC", Araki et al., Proceedings of ICASSP- 
4, April 1994) reduces the memory requirements by 
performing the half pel interpolation as the reference 
block data is read with the result being placed in the ref- 
erence block memory. It thus replaces the read block 
memories with a half-pel interpolation calculator, tt also 
eliminates the decoded block memory by combining the 
difference block with the prediction block as the data is 
written to the picture memory. The block diagram show- 
ing the simplified VDSP2 architecture for this procedure 
is shown in Fig. 6. In this figure, numeral 601 is the pic- 
ture memory, 602 is the half -pixel interpolator, 603 is the 
reference block memory for reference 1 , 604 is the ref- 
erence block memory tor reference 2, 605 is the refer- 
ence averager, 606 is the prediction block memory, 607 
is the IDCT processor, 608 is the difference block mem- 
ory, and 609 is the reconstructed The full pixel reference 
blocks are read from the picture memory 501 and the 
half-pixel interpolation is performed on each block by 
the half-pixel interpolator 602 and the resulting block is 
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written into a reference block memory 603 and 604. Fol- 
lowing this, the two references are combined by the ref- 
erence averager 605 with the result being written into a 
prediction block memory 606. This prediction block is 
then combined using the reconstructer 609 with the 
contents of the difference block memory 608 and written 
to the picture memory 601. The difference block mem- 
ory 608 is filled by other decoding processes, in this 
case the I DCT processor 607. 

3. Problems that the Invention is to Solve 

An object of the invention is to reduce the amount of 
temporary storage required for reconstructing the 
decoded pixel data using motion compensated refer- 
ences. Digital video processors such as the MPEQ1 
and MPEG2 codecs are used in consumer electronic 
applications, so the cost of implementation is very 
important. The size of the implementation is directly 
proportional to cost in many cases. An object of the 
invention is to reduce the size of temporary storage 
required in order to reduce the cost of the implementa- 
tion. 

4. Means of Solving the Problems 

For the purpose of solving the above-described 
problem, the present method for calculating the pixel 
values of a sub-pixel accuracy motion compensated 
block of video pixels from one or two reference blocks 
was invented. The invention comprises of transferring 
said reference block pixels to a calculator in an order in 
which the adjacent block lines of each block are consec- 
utive, alternating between each reference block with 
each block line, and ordering the block lines of both 
blocks in the same relative order, and in which the adja- 
cent pixels within each block line are consecutive. While 
this is occurring, said calculator calculates the predicted 
block and outputs the result as the data is calculated 
ordering the adjacent block lines of said predicted block 
consecutively in the same relative order as said refer- 
ence read order for one of the reference blocks, wherein 
the adjacent pixels within each block line are ordered 
consecutively in the same relative order as said refer- 
ence read order for one of the reference blocks. The cal- 
culator comprises of an x-interpoiator for performing the 
sub-pixel interpolation between pixels of the same block 
line; two reference block line buffers for temporarily stor- 
ing the block line from each reference block; a y-interpo- 
lator for performing the sub-pixel interpolation between 
pixels of consecutive block line; an interpolation refer- 
ence block line buffer for temporarily storing the sub- 
pixel accuracy reference block line from the reference 
for which the first data originated; and a reference block 
averager for averaging the block line stored in said inter- 
polation reference block line buffer with the interpolated 
reference block line of the subsequent reference block. 



5. Operation of the Invention 

The pixel data flow of the invention operates as a 
pixel pipeline within a block line pipeline. The data is 
5 read from the reference picture memories in the order 
that the calculations require the data. In this way, the 
pixels of a line of the block are interpolated using a sin- 
gle tap averaging filter as they are read and stored into 
the reference block line buffers. The line buffers act as 
10 the delay buffer in a 1 tap averaging filter for the block 
lines, in order to calculate the half pel interpolated block 
lines for each reference. The y-interpolator implements 
the computations for the averaging filter, with the result 
being a block line that has been averaged in two direc- 
ts tions, which is the half pel accurate block line for the first 
reference. It is stored in the interpolation reference 
block line buffer. The next line to pass through the y- 
interpolator is from the second reference, if two refer- 
ences are used. This data corresponds to the half pel 
20 accurate block line for the second reference. The results 
of this calculation is sent to the reference block averager 
along with the pixel values corresponding to the half pel 
accurate block line for the first reference which are 
stored in the interpolation reference block line buffer. 
25 The result from the reference block averager is the bidi- 
rectional half pel accurate prediction block line which 
can be combined with the output of the IDCT and sent 
to the picture memory as a fully decoded block line. 

30 BRIEF DESCRIPTION OF THE DRAWINGS 

Fig. 1 is a block diagram showing one embodiment 
of the method according to the present invention, 
Fig. 2 is a diagram showing how half-pixel precision 

35 is used to generate a prediction block, 

Fig. 3 is a diagram showing how a block can be pre- 
dicted using frame motion compensation by aver- 
aging two blocks from two reference pictures, 
Fig. 4 is a diagram showing how a block can be pre- 

40 dieted using field motion compensation by combin- 
ing individual fields, 

Fig. 5 is a block diagram showing one architecture 
for constructing the half-pixel precision prediction 
block from two references, 
45 Fig. 6 shows the architecture used by the VDSP2 
. for constructing the half pixel precision prediction 
block from two references, 
Fig. 7 describes an embodiment of the invented 
method, 

so Fig. 8 describes an example of the input and output 
pixel timing necessary for the operation of the 
invented method and 

Fig. 9 describes the timing for the embodiment of 
Fig. 7 if the example input and output timing of Fig. 
55 8 is used. 
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DETAILED DESCRIPTION OF THE PREFERRED 

E MBODIMENTS 

With reference to drawings, an embodiment for cal- 
culating the pixel values of a sub-pixel accuracy motion 
compensated block of video pixels from one or two ref- 
erence blocks and combining it with pixel difference 
block values to result in decoded pixel values will be 
described. 

Fig. 1 shows an apparatus for calculating the pixel 
values of a block from one or two prediction blocks 
according to the present invention. In Fig. 1, numeral 
101 is an input for the input pixels, 102 is an x-interpola- 
tor, 103 is an output of the x-interpolator 102, 104 is a 
reference 1 block line buffer, 105 is a reference 2 block 
line buffer, 106 is a bypass output of the x-interpolator, 
107 is an input of the y-interpolator, 108 is a y-interpola- 
tor, 109 is an output of the y-interpolator, 110 is a 
bypass output of the y-interpolator, 1 1 1 is an interpola- 
tion reference block line buffer, 112 is a reference block 
averager, 113 is an output of the reference block aver- 
ager, 1 14 is an IDCT processing block, 1 1 5 is an output 
of the IDCT processing block, 116 is a reconstructed 
and 1 1 7 is an output for the decoded pixels after being 
processed by the invented method. 

The embodiment of Fig. 1 operates as follows. The 
input pixels 101 are applied to an x-interpolator 102 
which functions as a filter for calculating an interpolated 
pixel value using pixel values of the same reference 
block line. The resulting interpolated pixel values output 
from the x-interpolator 103 are stored in either the refer- 
ence 1 block line buffer 104 or the reference 2 block line 
buffer 105 depending upon which reference the original 
pixels came from. These line buffers store the pixels 
corresponding to the x-interpolated input pixels for the 
reference blocks. The next line from the same reference 
is input to the input 101 and x-interpolated by the x- 
interpolator 102. As the results are output from the x- 
interpolator 103, they are simultaneously written to the 
respective reference block line buffer 104 or 105, and 
bypassed to the y-interpolator 108 using the bypass 
output of the x-interpolator 106. Simultaneously, the 
previous lines corresponding pixel value is read from 
the reference block line buffer and output 107 to the y- 
interpolator 108. In this way, the output 107 from the ref- 
erence block line buffer 104 or 105 is applied to the y- 
interpolator 1 08 at the same time as the output from the 
x-interpolator 103 corresponding to the next reference 
block line is applied to the bypass output of the x-inter- 
polator 106, and thus is input to the y-interpolator 108. 
The y-interpolator 108 functions as an interpolator for 
averaging the pixel values for a reference block line with 
the corresponding pixel values for the reference block 
line immediately before it. The output of the y-interpoia- 
tor 109 is written into the interpolation reference block 
line buffer 111. It contains the result of the reference 1 
half pixel accuracy reference block line as calculated by 
the use of the x-interpolator 102, the reference 1 block 
line buffer 104 and the y-interpolator 108. In a similar 



fashion, the reference 2 half pixel accuracy reference 
block line is calculated by the use of trie x-interpolator 
102, the reference 2 block line buffer 105 and the y- 
interpolator 108. When the output of the y-interpolator 

5 109 is outputting the half pixel accuracy reference 2 
block line, it is not written to the interpolation reference 
block line buffer 1 1 1 , but instead this data is passed to 
the reference block averager 112 using the bypass out- 
put of the y-interpolator 1 10. Simultaneously, the corre- 

10 spending pixel stored in the interpolation reference 
block line buffer 1 1 1 is passed to the reference block 
averager 112. The reference block averager 112 calcu- 
lates the average of these pixel values and applies it's 
output 113 to the reconstructer 1 16. Simultaneously, the 

is IDCT 114 applies it's corresponding pixel data to the 
reconstructer 116, with the output being the decoded 
pixel output 117. 

Fig. 8 describes an example of the input and output 
timing used by this embodiment to result in the required 

so decoded pixel output data. This example shows two ref- 
erence blocks where each is required to be half-pixel 
interpolated in both the horizontal and vertical directions 
before being averaged to form the decoded pixel block 
data, the desired half pixel values are located in the 

25 spaces between the circles representing the pixels. First 
the pixels of the top line of the reference block 1 are 
input, followed by the top line of the reference block 2. 
followed by the second line of reference block 1 , and so 
on. Each line is input from the leftmost pixel of the refer- 

30 ence block to the rightmost. The resulting decoded pixel 
data output order is a similar left-to-right, top-to bottom 
. order. 

As a further example, it is possible to apply the tim- 
ing of Fig. 8 to the embodiment of Fig. 1 , as an example 

35 of it's operation. The pixel 1 through 4 are applied suc- 
cessively to the x-interpolator 102, with the result being 
the average of 1 and 2, 2 and 3, and 3 and 4 written into 
the reference 1 block line buffer 104. Following this, the 
input pixels 5 through 8 are applied successively to the 

40 x-interpolator 1 02, with the result being the average of 5 
and 6, 6 and 7, and 7 and 8 written into the reference 2 
block line buffer 105. Next, the input and averaging of 
the 9 through 1 2 pixels are performed. The output from 
the x-interpolator for the average of 9 and 1 0 are applied 

45 to the y-interpolator 1 08 via the bypass output of the x- 
interpolator 106 at the same time as the average of pix- 
els 1 and 2 are read from the reference 1 block line 
buffer 104 and applied to the y-interpolator 108. The y- 
interpolator 108 then calculates the average of these 

so inputs resulting in the average of pixels 1, 2, 9 and 10. 
This corresponds to the half-pixel accuracy pixel values 
for reference block 1 . The result of this calculation is 
written into the interpolation reference block line buffer 
111 for each resulting calculated pixel value. Similarly 

55 the values of the half-pixel accuracy pixel values for ref- 
erence block 2 are calculated, but in this case, as they 
are calculated they are not written into the interpolation 
reference block line buffer 111. Instead they bypass is 
using the bypass output of the y-interpolator 1 10. In this 
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way the contents of the interpolation reference block 
line buffer 1 1 1 can be averaged with the output of the y- 
interpoiator 108. Thus, the average of pixels 1 , 2, 9 and 
10 can be averaged by the reference block averager 
1 12 with the average of pixels 5, 6, 13 and 14 resulting 
in a half pixel accurate prediction pixel formed from two 
references. This result can be added to the correspond- 
ing pixel from the IDCT processor 114 using the recon- 
structer 116. This results in the decoded pixel number 1 
in the decoded pixel block shown in Fig. 8. Similarly, all 
the pixels of this block can be calculated. 

Fig. 7 describes another embodiment for the 
invented method. In this figure, numeral 701 is an input 
for the reference block pixels, 702 is a pixel buffer 
bypass, 703 is a pixel buffer, 704 is an x-leftshifter input, 
705 is an x-adder, 706 is an x-leftshifter, 707 is a x-mul- 
tiplexer, 708 is a reference block line buffer bypass, 709 
is a reference block line buffer 1, 710 is a reference 
block line buffer 2, 71 1 is a y-adder 71 2 is a y-leftshifter, 
713 is a y-multiplexer, 714 is an xy-adder, 715 is an xy- 
rightshifter, 716 is an interpolation block line buffer, 717 
is an interpolation block line buffer bypass, 718 is a ref- 
erence adder, 719 is a reference-rightshifter, 720 is an 
IDCT processor, 721 is a reconstructer adder, and 722 
is a decoded pixel output. 

This embodiment can be implemented as a syn- 
chronous machine using latches clocked with a com- 
mon clock and combinational logic for processing and 
control between the latches. For this embodiment, the 
reference block pixels are input to the input 701 using 
the same cycle as the clocks for the latches. The pixel 
buffer 703 can be implemented as a latch. The refer- 
ence block line buffers 709 and 710, and the interpola- 
tion block line buffer 716 are implemented as a group of 
N latches, where N is the number of pixels in each block 
line. The other blocks in the diagram can be imple- 
mented using combinational logic. However the IDCT 
may be implemented using a synchronous circuit. For 
this embodiment the x-adder 705 is a simple adder, the 
x-leftshifter is a bit shifter for multiplying the binary pixel 
value by 2. The combination of pixel buffer 703, pixel 
buffer bypass 702 and x-adder 705 comprise an averag- 
ing circuit for horizontal averaging of the input pixel data. 
The output is equal to twice the average of the pixels 
averaged. The x-multiplexer 707 is used to select the 
output of the x-adder 705 or the output of the x-left- 
shifter 706 based on the half-pel-x value. The half-pel-x 
can be either 0, indicating that no half-pixel precision 
interpolation is required in the x-direction (horizontal), or 
1 indicating that interpolation is required. The output of 
the x-multiplexer 707 is stored in the reference block line 
buffers 709 and 710. These buffers comprise of latches 
which act as synchronous FIFOs. The outputs of the x- 
multiplexer 707 which originates from input reference 
block 1 are written to Reference block line buffer 1 709 
while the outputs which originate from input reference 
block 2 are written to Reference block line buffer 2 710. 
The y-adder 71 1 is a simple adder, and the y-leftshifter 
712 is a bit shifter for multiplying the pixel value by 2. 



The combination of reference block line buffer 709 or 
710, reference block line buffer bypass 708, and y- 
adder 71 1 comprise an averaging circuit for vertical 
averaging of the input pixel data. The output is equal to 

5 twice the average of the pixels averaged. The y-multi- 
plexer 713 is used to select the output of the y-adder 
711 or the output of the y-leftshifter 712 based on the 
half-pel-y value. The half-pel-x can be either 0, indicat- 
ing that no half-pixel precision interpolation is required 

w in the y-direction (vertical), or 1 indicating that interpola- 
tion is required. The output from the y-multiplexer 713 is 
applied to the xy-adder 714, which simply adds 2 to the 
input. This result is shifted 2 bits to the right by the xy- 
rightshifter 715. The combination of the xy-adder and 

15 xy-rightshifter results in a divide by 4 with rounding up to 
the nearest integer result. This result is the half-pixel 
precision reference pixel from reference 1 or 2. If this 
result comes from reference 1 pixels, it is written into the 
interpolation block line buffer 716. If it is from reference 

20 2, it is applied to the input of the reference adder 718, 
which adds it's inputs plus an additional 1. In other 
words, the half pixel accurate prediction pixel for refer- 
ence 2 is added to the corresponding half pixel accurate 
prediction pixel for reference 1 which is stored in the 

25 interpolation block line buffer 716. The output of the ref- 
erence adder 71 8 is shifted right by 1 position by the ref- 
erence rightshifter 719. The combination of the 
reference adder, 718 and the reference rightshifter 719 
is a circuit for dividing the sum of the two inputs by 2 with 

30 rounding up to the nearest integer. The output is there- 
fore the result of averaging the half pixel accurate pre- 
diction pixels from both reference 1 and 2. The 
corresponding difference pixel value is generated by the 
IDCT processor 720 and added to the output of the ref- 

35 erence rightshifter 719 using the reconstructer adder 
721. The result is a decoded pixel value at the output 
720. 

This embodiment can be, but is not limited for use in 
a video processing system where the reference block 

40 pixels which are input 701 come from a picture memory, 
and the resulting decoded pixels output 722 are written 
to the same memory. 

An example of the input and output timing for the 
pixel data is shown in Fig. 8. The resulting timing for this 

45 embodiment using this pixel data timing is shown by Fig. 
9. For this example, the picture memory is read one 
clock before the reference pixel input. This would corre- 
spond to a latch between the picture memory and the 
reference pixel input 701 . The numbers in Fig. 9 refer to 

so pixels as described in Fig. 8. From Fig. 9 it can be seen 
that the input to the reference block line buffer 1 consists 
of the sum of pixels 1 and 2 when the pixel 2 value is 
applied to the reference pixel input 701. Similarly, the 
sums of pixels 2 and 3, 3 and 4. 5 and 6, 6 and 7, and 8 

55 and 9 are calculated and applied to the inputs of the ref- 
erence block line buffers 1 and 2 709 and 71 0. The input 
to the interpolation block line buffer 71 6 is the sum of the 
reference pixel input, x-interpolator output, and refer- 
ence block line buffer 1 input, all divided by four. This 
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corresponds to the reference 1 half pixel accurate val- 
ues. These are averaged with the reference 2 half pixel 
accurate values, and added to the IDCT difference val- 
ues to result in the decoded pixel values called x, y and 
2 in Fig. 9. These can be written to the same picture 
memory as the input data originates from by delaying it 
slightly so that after pixel 16 is read from memory, data 
z can be written. This additional delay on the decoded 
delay is easily implemented using latches. 

Note that it is possible to change the timing of Fig. 
9. The timing of the memory read and write timing can 
be changed, and additional latches can be added to the 
embodiment of Fig. 7 in order to reduce the amount of 
combinational logic that signals must propagate through 
between latches. In addition, the embodiment describes 
the x-direction to be synonymous with horizontal, and y- 
direction as synonymous with vertical, but these can be 
reversed. The results will be similar. In addition, the use 
of half-pixel accuracy interpolation in the embodiments 
can be changed to quarter-pixel or better accuracy by 
increasing the number of taps on the x and y interpola- 
tors. 

The number of pixels in each line and the number of 
lines in each block of pixels is not limited to the exam- 
ples described by these embodiments. The example of 
the timing described by Fig. 8 and 9 show how half pixel 
interpolation can be done in both the horizontal and ver- 
tical directions. The embodiments described can also 
be used when full pixel accuracy is required for horizon- 
tal, or vertical or neither directions. This was described 
in the embodiment for the use of the x-leftshifter and y- 
leftshifter. The number of reference pixels in a block ref- 
erence depends upon the need for half pixel interpola- 
tion. If half pixel interpolation is required, the reference 
block must have one extra pixel in the direction of inter- 
potation, as in the example described by Fig. 8. More 
pixels are needed in the case of higher precision inter- 
polation. In this example, a 4x4 pixel reference block is 
interpolated to make a 3x3 prediction block. 

The advantage of this invention is to reduce the 
required temporary storage memories to as few as 3 
block line memories (two reference block line buffers 
and the interpolation reference block line buffer) plus a 
pixel buffer inside the x-interpolator. This is much less 
than the temporary memory storage requirements of 
the examples described in the prior art. The result will 
be much cheaper implementations of the motion com- 
pensation circuitry used in consumer electronics prod- 
ucts which require this function. The reason the 
memory can be reduced so much is due to the use of a 
pipeline within a pipeline which allows the calculation of 
x-interpolated half pel precision values to be performed 
at the same time as y-irrterpolated half pel precision val- 
ues, which can be performed at the same time as the 
two references are interpolated. This invention mini- 
mizes the time for which a pixel must be delayed in 
order to calculate the prediction pixels, thus eliminating 
the need for additional memory storage devices. 

Although the present invention has been fully 



described in connection with the preferred embodi- 
ments thereof with reference to the accompanying 
drawings, it is to be noted that various changes and 
modifications are apparent to those skilled in the art. 
5 Such changes and modifications are to be understood 
as included within the scope of the present invention as 
defined by the appended claims unless they depart 
therefrom. 

10 Claims 

1 . A method for calculating pixel values of a sub-pixel 
accuracy motion compensated block of video pixels 
from one or two reference blocks and combining it 

15 with pixel difference block values to result in 
decoded pixel values comprising the steps of: 

transferring said reference block pixels in a ref- 
erence read order to a calculator; 
20 calculating the predicted block using said cal- 

culator; 

outputting said predicted block in a predicted 
block write order to a reconstructed and 
combining said predicted block with said pixel 
25 difference block using said reconstructed 

2. A method according to claim 1 . wherein said refer- 
ence read order comprises the steps of: 

30 ordering the adjacent block lines of each block 

consecutively, alternating between each refer- 
ence block with each block line, and ordering 
the block lines of both blocks in the same rela- 
tive order; 

35 ordering the adjacent pixels within each block 

line consecutively, and ordering the pixels of 
both blocks in the same relative order. 

3. A method according to claim 1, wherein said pre- 
40 dieted block write order comprises the steps of: 

ordering the adjacent block lines of said pre- 
dicted block consecutively in the same relative 
order as said reference read order for one of 
45 the reference blocks; and 

ordering the adjacent pixels within each block 
line consecutively in the same relative order as 
said reference read order for one of the refer- 
ence blocks. 

50 

4. A method according to claim 1 wherein said calcu- 
lator comprises: 

an x-interpolator for performing the sub-pixel 
55 interpolation between pixels of the same block 

time; 

two reference block line buffers for temporarily 
storing the block line from each reference 
block; 
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a y-interpolator for performing the sub-pixel 
interpolation between pixels of consecutive 
block lines; 

an interpolation reference block line buffer for 
temporarily storing the sub-pixel accuracy ref- 
erence block line from the reference for which 
the first data originated; 
a reference block averager for averaging the 
block line stored in said interpolation reference 
block line buffer with the interpolated reference 
block line of the subsequent reference block. 

5. A method according to claim 4, wherein said x- 
interpolator comprises: 

an x-pixel buffer for temporarily storing and 
delaying the input pixel data; 
an x-adder for adding the input pixel data with 
the delayed pixel data from said x-pixel buffer; 
an x-multiplier for multiplying the input pixel 
data by two; and 

an x-multiplexer for selecting the result of said 
x-interpolator from either the output of said x- 
adder when sub-pixel interpolation between 
pixels of the same block line is required, or the 
output of said x-multiplier when sub-pixel inter- 
polation between pixels of the same block line 
is not required. 

6. A method according to claim 5, wherein said x-mul- 
tiplier comprises: 

a shifter for shifting said input pixel data one bit 
position left, resulting in a multiplication by 2. 

7. A method according to claim 4, wherein said refer- 
ence block line buffer comprises: 

the same number of pixel buffers as there are 
pixels in a block line. 

8. A method according to claim 4, wherein said y- 
interpolator comprises: 



a y-adder for adding the input block line pixel 45 
data with the delayed block line pixel data from 
said reference block line buffer, wherein the 
source of the y-adder inputs originate from the 
same reference block; 

a y-multiplier for multiplying the input block line so 
pixel data by two; 

a y-multiplexer for selecting the result of said y- 
interpolator from the output of said y-adder 
when sub-pixel interpolation between pixels of 
consecutive block lines is required, and from 55 
the output of said y-multiplier when sub-pixel 
interpolation between pixels of consecutive 
block lines is not required; and 
a y-divider for dividing the output from the y- 



2adder by 4, resulting in the output of said y- 
interpolator. 

9. A method according to claim 8, wherein said y-mul- 
5 tiplier comprises: 

a shifter for shifting said input block line pixel 
data one bit position left, resulting in a multipli- 
cation by 2. 



10. A method according to claim 8, wherein said y- 
divider comprises: 

a y-2adder for adding two to the output of the y- 
multiplexer; and 

a y-4divider for dividing the output from the y- 
2adder by 4 with truncation. 



11. A method according to claim 10, wherein said y- 
20 4divider comprises: 

a shifter for shifting said output from the y- 
2adder right by 2 resulting in a division by 4 
with truncation. 



30 



12. A method according to claim 4, wherein said inter- 
polation reference block line buffer comprises: 

the same number of pixel buffers as there are 
pixels in a block line. 

13. A method according to claim 4, wherein said refer- 
ence block averager comprises: 



35 



an averaging adder for adding one to the sum 
of the input to said interpolation reference block 
line buffer and the delayed output of said inter- 
polation reference block line buffer; and 
an averaging divider for dividing the output of 
40 said averaging adder by two with truncation. 



14. A method according to claim 13, wherein said aver- 
aging divider comprises: 

a shifter for shifting said output of said averag- 
ing adder right by 1 resulting in a division by 2 
with truncation. 

15. A method according to claim 1, wherein said pixel 
difference block comprises: 

data resulting from an inverse discrete cosine 
transform operation. 

16. A method according to claim 1 , wherein said recon- 
structer comprises: 

an adder. 
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17. An apparatus for calculating pixel values of a sub- 
pixel accuracy motion compensated block of video 
pixels from one or two reference blocks and com- 
bining it with pixel difference block values to result 
in decoded pixel values comprising: 5 

means for transferring said reference block pix- 
els in a reference read order to a calculator; 
means for calculating the predicted block using 
said calculator; 

means for outputting said predicted block in a 
predicted block write order to a reconstructed 
and 

means for combining said predicted block with 
said pixel difference block using said recon- 
structed 
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